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Abstract. This note is concerned with concentration inequalities for extrema 
of stationary Gaussian processes. It provides non-asymptotic tail inequalities 
which fully reflect the fluctuation rate, and as such improve upon standard 
Gaussian concentration. The arguments rely on the hypercontractive approach 
developed by Chatterjee for superconcentration variance bounds. Some statis¬ 
tical illustrations complete the exposition. 


1. Introduction 

1.1. Convergence of extremes. As an introduction, we recall some classical facts 
about weak convergence of extrema of stationary Gaussian sequences and processes. 

1.1.1. Stationary Gaussian sequences. Let be a centered stationary Gauss¬ 

ian sequence such that E = 1, * > 0, with covariance Cov{Xi,Xj) = 

b J ^ Oj where (j) : N —>■ R. An extensive study has been developed in this 
setting towards the asymptotic behaviour of the maximum Mn = maxi=i^,,._ji 
(Berman, Mittal, Pickands... cf. e.g. [12]). A sample result is the following theorem 
(see [12]). 

Theorem 1. Let {Xi)i>o be a stationary Gaussian sequence with covariance func¬ 
tion (j) such that 4>{n) logn —^0 as n —^ oo. Then 

a„(M„ - bn) —)> G 

n—^oo 

in distribution where a„ = (21ogn)^/^, 

bn = (21ogn)^/^- I(21ogn)“^/^(loglogu-f logdTr) 
and the random variable G has a Gumbel distribution (for all a; G K, P(G < x) = 

1.1.2. Stationary Gaussian processes. Let {Xt)t>Q be a centered stationary Gauss¬ 
ian process such that E [Xf] = 1, t > 0, with covariance function Cov{Xs, Xt) = 
(j)[\t — sj), for t, s > 0, where (j) : [0, -l-oo) —>■ R. Consider two behaviors, as t 0, 
of the covariance function (f : 

(1.1) (j){t) = l-^-fo(f2) 

( 1 . 2 ) cfit) = l-C\tr + o{\tr) 
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The first case ensures that Xt is differentiable and A 2 = —(j)''{0) is a spectral 
moment, whereas the second case concerns non-differentiable (but continuous) pro¬ 
cesses (such as the Ornstein-Uhlenbeck process). For more details about this topic, 
see [12]. For any T > 0, set Mt = sup^^jg -j^j Xt- 

Theorem 2. Let {Xt)t>o be a stationary Gaussian process such that (j)(t) logt —> 0 
as t —>■ 00 . Then 

aj'(Mx — bj^) —y G 

T-s-oo 

in distribution where gt = (21ogr)^/^, bx depends on the hypothesis (1.1) or (1.2) 
and G has a Gumbel distribution. 

The aim of this note is to quantify the preceding asymptotic statements into 
sharp concentration inequalities fully reflecting the fluctuation rate of the maxi¬ 
mum (respectively the supremum). Such variance bounds with the correct scale 
were first obtained in this context by Chatterjee in [7], as part of the superconcen¬ 
tration phenomenon. The results presented here strengthen the variance bounds 
into exponential tail inequalities. One main result is the following statement. 

Theorem 3. Let {Xi)i>Q be a centered stationary Gaussian sequence with covari¬ 
ance function fi. Assume that (j) is non-increasing and satisfies ^i(l) <1/2. Then, 
there exists a = € (0,1) and c = c{<fi, a) > 0 such that for all n > 2, 

(1.3) P(|M„ - E[M„]| >t)< 6e-=*/V“a’'00“),i/iogn)^ ^ > q. 

Remark. Under the hypothesis of Theorem 1, max((/(n“), 1/logn) = 1/logn for 
n large enough, which is exactly the fluctuation rate. Observe furthermore that 
integrating (1.3) recovers the variance bounds of [7]. 

It is important to compare Theorem 3 to classical Gaussian concentration (see 
e.g. [13]) which typically produces 

(1.4) P(1M„-E[M„] 1 > f) < 2e-‘'/2^ f > 0. 

While of Gaussian tail, such bounds do not reflect the fluctuations of the extremum 
M„ of Theorem 1. Moreover, with respect to this Gaussian bound. Theorem 3 
actually provides the correct tail behavior of the maximum M„ in the form of a 
superconcentration inequality, in accordance with the fluctuation result and the 
limiting Gumbel distribution (since P(G > a;) ~ as x —>■ 00 ). Let us also 
emphasizes that Theorem 3 covers the classical independent case, when all the 
XiS are independent, by taking </> = 0 and provides the following concentration 
inequality which will be useful in the last section, 

(1.5) P(lM„-E[M„]l>f)<6e-°*^'^, t > 0. 

Observe in addition that Theorem 3 expresses a concentration property of the 
maximum around its mean whereas, in the regime of the convergence of extremes 
of Theorem 1, the centerings are produced by explicit values Actually, up to 
numerical constants, the same inequalities hold true around the constant instead 
of the mean. To this task, it is enough to prove that sup„E [ja„(M„ — 6„)j] < 00 . 
Set Zn = an{Mn — bn). Letting be an independent copy of M„, set similarly 
Z'n = an{M'^ — bn). Now, integrating (1.3), sup„E[a„lM„ — E[M„]j] < 00 . Hence 
sup„ 'R\\Zn — Z’nW <00 from which it easily follows that sup„ E [jZ„[] < 00 . 

The next statement is the analogue of Theorem 3 for stationary Gaussian pro¬ 
cesses, suitably quantifying Theorem 2. It is presented in the more general context 
of a centered stationary Gaussian field (At)jgRd such that E [A/] = 1, t € with 
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covariance function (j). According to [7], very little seems actually to be known on 
the asymptotic fluctuations of the supremum of stationary Gaussian processes in¬ 
dexed by when the dimension d is greater than two. Some recent specific results 
are available for Gaussian fields with logarithmic correlation (see e.g. [1, 10, 15, 9] 
and [11] for an overview on the subject). However, extending similarly the variance 
bounds of [7] , we obtain a concentration inequality for the supremum of a Gaussian 
field over a subset A of The case d = 1 thus covers Theorem 2. 

Theorem 4. Let {Xt)t^g^d be a stationary Gaussian Euclidean field with covariance 
function cj). Assume that t i—>■ (fft) is non-increasing and ^(1) < 1/2. If A is a 
subset ofMf, denote by N{A) = N{A, 1) the minimal number of balls with radius 1 
needed to cover A. Set M{A) = sup^g^ Xg- Then, there exist C = C[4>, d) > 0 and 
c = c{(j), d) > 0 only depending on f) and d such that, for all A C with N{A) > 1, 

¥{\Ma-E [Ma] \>t)< , t>0, 

where 

Kn(A) = max {(j) {N{A)^) , 1/ log A^(A)) . 

Remark. When d = 1 and A = [0,T] for some T > 0, N{A) = T/2 and under the 
hypotheses of Theorem 2, Kjq^A) = l/logT for T large enough. 

The proofs of Theorem 3 and 4 are based on the hypercontractive approach 
developed by Ghatterjee [7] towards variance bounds. The task will be to adapt 
the argument to reach exponential tail inequalities at the correct fluctuation regime. 
This will be achieved via the corresponding variance bound at the level of Laplace 
transforms. 

The paper is organized as follows. In the next section, we present the semigroup 
tools used to prove the main results. The third section is devoted to the proof of 
Theorems 3 and 4. In Section 4, we present some illustrations to further Gaussian 
models. In the final section, we present an illustration in statistical testing. 

2. Framework and tools 

2.1. Notation. For any n > I, denote by A = (Ai,..., A„) a centered Gaussian 
vector, with maximum M„ = maxj=i^,,,_„ A^. Set I = argmax^^j^ n^i- 

The main technical part of this work is provided by the following extension of 
Ghatterjee’s approach [7] adapted to exponential concentration bounds. The very 
basis of the argument is hypercontractivity of the Ornstein-Uhlenbeck semigroup 
together with the exponential version of Poincare’s inequality (cf. [5, 4, 7]). 

Theorem 5. Let X = (Ai,..., A„) be a centered Gaussian vector with covariance 
matrix T. Assume that for some rg > 0, there exists a non trivial covering C{ro) of 
{1,..., n} verifying the following properties: 

• for all i,j € {1,..., n} such that F^ > rp, there exists D G CItq) such that 
hj G D; 


• there exists C > 1 such that, a.s., J^n^Ciro) Si C. 


Let p(ro) = max£,gc(ro) G D). 
Then, for every 0 G R, 


( 2 . 1 ) 




1 

log (l/p(ro)) 
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In particular, 

P(|M„ - E[M„] I > t) < t > 0, 

where Krg = max (tq, iog(i/p(r-o)) ) c> 0 is a universal constant. 

Remark. By monotone convergence, we can obtain the same result for a supremum 
instead of a maximum. This fact will be useful to obtain an application of 
Theorem 5 for a Gaussian Euclidean field. 


Proof. As announced, the scheme of proof follows [7]. The starting point is the 
representation formula 

pCO 

(2.2) Var(/)= / e-‘E [V/• PtV/] dt 

Jo 

for the variance of a (smooth) function / : K" —R along the Ornstein-Uhlenbeck 
semigroup (Tt)t>o' General references on the Ornstein-Uhlenbeck semigroup and 
more general Markov semigroups, as well as such semigroup interpolation formulas 
are [5, 4]. 

Following [7], given a centered Gaussian vector X with covariance matrix T = 
{^ij)l<i,j<n, apply (2.2) to (a smooth approximation of) / = where M{x) = 

maxj=i...._„(i3a:)i and T = B*B. It yields that, for all 0 G R, 


/ \ /•o 

4 i 


'E 


r 1 JMtl2 


Li=i 


dt. 


Here {X G Ai} = {/ = *}, z = 1,..., n, and X* = e“‘X -|- \/l — Y, where Y is 
an independent copy of X, with corresponding maximum M^. This process shares 
the same hypercontractivity property as the Ornstein-Uhlenbeck process (cf. [7]). 
Indeed, assuming without loss of generality that B is invertible (otherwise restrict 
to a subspace of R"), define the semigroup {Qt)t>o by 


Qtf{x) = E [/ (e-^x + 


for / : R" —^ R smooth enough. Let Z be a standard Gaussian vector on R” so 
that Y = BZ in law. Thus, by defining g : R" —>■ R as g{x) = f{Bx), we have, for 
all X G R", 


Qtf{x) 


E 

E 

E 


/ (^e~*x+ \f\ 
f [e-^BB-'^x 
g {^e~*B~^x + 


- 

-k \/l - e-^^Bz'j 


^1 - e-2iZ 


) =Ptg{B ^x) 


Therefore, by the hypercontractive property of {Pt)t>o with the same exposants p 
and q as the Ornstein-Uhlenbeck semigroup, 

(2.3) E [|QJ(X)r]'/^ = \\Ptg\\, < \\g\\p = E [\f{X)ff^ , 


where || • ||r, for any r > 1, stands for the T’'-norm with respect to the standard 
Gaussian measure on R". 


















SUPERCONCENTRATION OF GAUSSIAN STATIONARY PROCESSES 


5 


Now, for every f > 0, with J* = argmax^^^ 


I = E 


= E 


n 




^0M„ 


e---n/2l JMi /2 

{XGA,}e l{XtGA,}e 


g0(M„+M*)/2 


= E 

= 


ui=i 

ge(M„+M*)/2p 


fe >0 


II* 

e(M„+M‘) /2 


Fz/tlj 


To ease the notation, denote T/jt by T and set F* = M„ + M^. Let ko = min{A: > 
0; ’’o < 2“^“^}. Cutting the preceding sum into two parts, we obtain 

ko oo 

I < 22-'=E 2-'=E 


fc=0 


k—kQ-\-l 


i{2-fc-i<r<2-fc} 


OeC(ro) 


^roE 


{2~fc-i<r<2~fc} 




{i,i*eD} 


k>0 


- toE 


JF*/2 


= 2 E ® 

D&C{ro) 

By the Cauchy-Schwarz inequality and Gaussian rotational invariance. 


E 


ffF*/2 


< E . 


On the other hand, by Holder’s inequality with ^ ^ = Ij 


E 


JF* 12^ 

e i{i.iteD} 


= E 


< E 


^epM„/2^ 


{/GO} 


E 


egM*/2-, 


1/9 


Next, by the hypercontractivity property (2.3), 


E 


^eqM*„/2^ 


{PeD} 


1/9 


< E 


'^0pM„/2^ 


{/GO} 


1/p 


provided that e^* = that isp = l + e ‘<2. As a consequence. 


E 


e0F*/2^ 


{/./‘GO} 


< E 


^0pM„/2^ 


l2/p 


{/GO} 


and by a further use of Holder’s inequality. 


E 


Combining the preceding with the assumptions, we get that 
I< (ro + Cp{rof-^) E[e®'^"] . 

Finally, 

Var < C ^ (^tq + E [e®“"] . 

The announced inequality (2.1) then follows since 

Jo log(l/p(/'o)) 
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To end the proof of Theorem 5 and produce exponential tail bounds, we combine 
(2.1) with the following lemma (see [13]). 

Lemma 6. Let Z be a random variable and K > 0. Assume that for every \0\ < 
(2.4) Var ^ ^ ^ ® • 


Then, 


P (|Z - E [Z] l>t) < 


for all t > 0, with c > 0 a numerieal constant. 

Remark. Such a lemma has been used in prior works [6, 8] in order to obtain 
exponential concentration tail bounds for first passage percolation models. 


□ 


3. Proofs of the main results 

3.1. Proof of Theorem 3. The task is to show that the hypothesis of Theorem 5 
are satisfied. For 0 < a < 1, choose vq = (j){[n°‘\) and C{ro) = {Hi, £> 2 , • ■ •} where 
Hi = (1,..., 2[n“J}, H 2 = {[n“J,..., 3[n“J} and so on, where [•] denote the 
integer part of a real number. 

It is easily seen that this covering satisfies the hypothesis of Theorem 5. Indeed, 
take i,j € {1 ,... ,n} such that i < j. By stationarity, Cov(Xi+i, Xj) = (f>[j — i). 
So if 4>{j — i)>rQ= <f){\n °‘\), since tp is non-increasing, we must have j — i < [n“J. 
By construction, any index i is at most in three different elements H of C(ro). So i 
belongs to some Di ^, H^^ and H^g with ii < i 2 < is and j belong to Dj ^, Hj^ and 
Hjg with ji < j 2 < js- Since j — i < n°‘ and the length of any H in C(ro) is 2[n“J, 
there exists s G {1,2,3} such that Di^ = Dj^ (draw a picture) and we can choose 
D = Di^ G C(ro). 

Clearly J2DeCiro) where C is a universal constant (say C = 3). To 

end the proof, we have to bound max£)gc(r(,) P(/ G D). Let H G C{ro), and to 
ease the notation, consider H = {1,..., 2[n“J} (the important feature is that H 
contains 2[n“J elements). Then 

2Ln“J 

P(J G D) = HI =^)- 

i=l 

By standard Gaussian concentration (see [13] or the appendix of [7]), for all i G D, 

P(j = i) = P(Xi = M„) 

< F{Xi >t) + P(M„ < t) 

The final step of the argument is to achieve a lower bound on E [M„]. To this task, 
we make use of Sudakov’s minoration (see [3] or [14]). Letting i,jGD,i^ j, 

E[{X,-X,f] = 2-2cp{\j-i\)>2{l-cP{l))=5 

since (p[\j — i|) < Hi). By the assumption on (()(1), (5 > 0, and thus by Sudakov’s 
minoration there exists c > 0 (independent of n) such that E [M„] > c^y^logn. As 
a consequence of the preceding, 

2 2 


P(J = i)< 


n' 




SUPERCONCENTRATION OF GAUSSIAN STATIONARY PROCESSES 


7 


Hence 


F{i eD)< 



4 

rF! 


where a is such that ry = e — a > 0, namely a < {c6)‘^l2 < 1. 

The hypotheses of Theorem 5 are therefore fulfilled with p{ro) < 4/ri^ concluding 
the proof of Theorem 3. 


3.2. Proof of Theorem 4. We again follow Chatterjee’s proof of variance bounds 
for Gaussian Euclidean fields (see Theorem 9.12 in [7]). For any Borel set A C 
set m{A) = E [M{A)] = E [sup^g^X^]. Under the hypothesis of the theorem, it is 
proved in [7] that for any H C such that N{A) > 1, 

ci{cl),d)^/logN{A) < m{A) < c2{4>,d)\/\ogN{A) 

where c\ = ci{4i,d) and C 2 = C 2 { 4 '-,d) are positive constants that depend only on 
the covariance (j) and the dimension d (and not on the subset A). Set then 

So = 

and assume that N{A) is large enough so that sq > 2. Let rg = (j){so). Take a 
maximal SQ-net of A ( i.e. a set of points that are mutually separated from each 
other by a distance strictly greater than sq and is maximal with respect to this 
property), and let C(ro) be the set of 2so-ball around the points in the net. In 
fact, C(ro) is a covering of A satisfying the condition of Theorem 5. With this 
construction, it is easily seen that J^DGCirg) ^ ^ 

max F{I e D) < jn ■ 

DGC(ro) 

The conclusion then directly follows from Theorem 5 with p{ro) < 1/N{A)^^'^’‘^\ 


4 . Other Gaussian models 

In [7], Chatterjee exhibits different models in order to illustrate the so-called 
superconcentration phenomenon. For each of them, he managed to reach a better 
variance bound than the one produced by standard Gaussian concentration. We 
present for some of these models an exponential version. 

Proposition 7. For any n > 2, let X = (Xi,...,X„) be a centered Gaussian 
vector with E [Xi] = 1 and E < e for alli,j = 1,..., n for some e > 0. Then 

F{\Mn - E[M„] I > t) < t > 0, 

where Kn = max(e, 1/logn). 

Proof. Take tq > e and define C(ro) = {{!},..., {X}}. It is easy to see that the 
covering C(ro) enters the setting of Theorem 5. Furthermore, J^D^Ciro) = 1 

and ma.xjj^c{ro) ^^(^ & D) < 1/n^ for some p > 0. The conclusion follows. □ 

As another instance, consider the Gaussian field on {—1,1}", n > 1, defined in 
the following way. Let Xi,..., X„ be independent identically distributed standard 
normals, and define / : {—1,1}"—?^Ras 

n 

fi<^) = ^XiCTi. 

2=1 
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Corollary 8. There exists S = . ,a^) € {—1,1}” for some N, such that, 

for all a ^ a' G S, \cr ■ a'\ < and 

P(|/-]E[/] I > i) < i>0. 

Proof. Chatterjee proved in [7] the existence of S' G {—1,1}” on which 
\a ■ a'\ < . Since, Cov(/((t), /(ct)) = a • a', it suffices to apply Proposition 7 to 

/|5 and e = . □ 


5. Statistical testing 

In this last section, we illustrate the preceding results on a statistical testing 
problem analyzed in [2]. There, the authors used standard Gaussian concentration 
to obtain an acceptance region for their test. Using some of the material devel¬ 
oped here, the conclusion may be reinforced in some instances in the form of a 
superconcentration bound. 

Following the framework and notation of [2] , we observe an n dimensional Gauss¬ 
ian vector X = {Xi ,..., X„) and raise the following hypothesis problem: 

• Under the null hypothesis Hq, the components of X are independent 
identically distributed standard normal random variables. Denote then by 
Pq and Eg respectively the underlying probability measure and expectation 
under Hq. 


• To describe the alternative hypothesis Hi, consider a class C = 
{Si,..., Sn} of N sets of indices such that Sk C {!,..., n} for all 
k = 1,..., N. Under Hi, there exists an S' € C such that 

X, has distribution | \ ^ 

\ Af{p,, 1) liiG S, 

where > 0 is a positive parameter. The components of X are independent 
under Hi as well. For any S G C, denote then by Ps and E 5 respectively 
the underlying probability measure and expectation under Hi. We will also 
assume that every S G C has the same cardinality |S| = K. 

We recall that a test is a binary-valued function / : K.” —>■ {0,1}. If f{X) = 0, 
the test accepts the null hypothesis, otherwise Hq is rejected. As in [2], consider 
the risk of a test / measured by 

R{f) = Po(/(A) = 1 )) + 1 ^ P5(/(A) = 0)). 

^ Sec 

The authors of [2] are interested in determining, or at least estimating, the value 
of /I under which the risk can be made small. Among others results, they used a 
test based on maxima, called the scan-test, for which they showed that 


f{X) = 1 


max Xs > uK + En 
Sec ^ 


maxAv 

Lsec 


where Xs = J2i^s some S G C. They obtain the following result. 


Proposition 9. The risk of the maximum test f satisfies R{f) < <5 whenever 


T > 


K 


■ En 


maxA.c 

Lsec 



Together with (1.5), this bound may be improved in accordance with the correct 
magnitude of the maximum of a standard Gaussian vector. 
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Proposition 10. For any n>2, the risk of the maximum test f satisfies R{f) < S 
whenever 




maxXs 

Lsgc 


1 6 2 

+ log - X - , 

5 Cy/K log N 


where c> 0 is a universal constant. 


Proof. We follow the proof of [2] to get that both, for every t > 0, 


^ofmaxXg > Eq 
V SgC 


max Xq 
L Sec 


+ 


and, for any S € C, 

Vs(criaxXs'<fJ'^^~t) ^ P5rnaxXs< < 

< 3g-ct-v^log(Ar)/A' 


max Xq! 
L S'eC 


-t\ 


since, under iJi, for a fixed S & C, fiK = E 5 [X 5 ] < Eg [maxs/gc ^S'] and Xs has 
the same law as any Xs'. Set then t such that 


2t = yiK — E 


max Xq 
L Sec 


which is positive according to the hypothesis. With the previous inequalities, we 
obtain a bound on the risk 

R[f) < 6e"‘=‘Vi°8(^)/-^ 


which may then be turned into R{f) < <5 as in the statement. 


□ 


The new bound of Proposition 10 is as good as the one of Proposition 9 when 
5 ~ l/N"^ for some a > 0. It is better than Proposition 9 when ^ ~ l/log“(A/") 
with a > 0. Nevertheless, Proposition 9 is better than Proposition 10 when 
5 ~ , for a > 0. However, it might not be relevant to consider such small risk. 


Aknowledgment. I thank my Ph.D advisor M. Ledoux for introducing this 
problem to me and for fruitful discussions. 
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